Sequence-Level Speaker Change Detection With Difference-Based Continuous Integrate-and-Fire

نویسندگان

چکیده

Speaker change detection is an important task in multi-party interactions such as meetings and conversations. In this paper, we address the speaker from perspective of sequence transduction. Specifically, propose a novel encoder-decoder framework that directly converts input feature to identity sequence. The difference-based continuous integrate-and-fire mechanism designed support framework. It detects changes by integrating difference between encoder outputs frame-by-frame transfers segment-level embeddings according detected changes. whole supervised sequence, weaker label than precise points. experiments on AMI DIHARD-I corpora show our sequence-level method consistently outperforms strong frame-level baseline uses labels.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Change Detection based on Mean Shift

To settle out the problem that search of speaker change point (SCP) is blind and exhaustive, mean shift is proposed to seek SCP by estimating the kernel density of speech stream in this paper. It contains three steps: seeking peak points using mean shift firstly, using maximum likelihood ratio (MLR) to compute the MLR value of the peak points secondly, and seeking SCPs from MLR value using the ...

متن کامل

Conductance-Based Integrate and Fire Models

A conductance-based model of Na+ and K+ currents underlying action potential generation is introduced by simplifying the quantitative model of Hodgkin and Huxley (HH). If the time course of rate constants can be approximated by a pulse, HH equations can be solved analytically. Pulse-based (PB) models generate action potentials very similar to the HH model but are computationally faster. Unlike ...

متن کامل

Integrate and Fire Neurons

SpikeNET is a simulator for modeling large networks of asynchronously spiking neurons. It uses simple integrate-and-fire neurons which undergo step-like changes in membrane potential when synaptic inputs arrive. If a threshold is exceeded, the potential is reset and the neuron added to a list to be propagated on the next time step. Using such spike lists greatly reduces the computations associa...

متن کامل

On-line incremental speaker adaptation with automatic speaker change detection

In order to improve the performance of speech recognition systems when speakers change frequently and each of them utters a series of several sentences, a new unsupervised, online and incremental speaker adaptation technique combined with automatic detection of speaker changes is proposed. The speaker change is detected by comparing likelihoods using speaker-independent and speaker-adaptive GMM...

متن کامل

Change detection from satellite images based on optimal asymmetric thresholding the difference image

As a process to detect changes in land cover by using multi-temporal satellite images, change detection is one of the practical subjects in field of remote sensing. Any progress on this issue increase the accuracy of results as well as facilitating and accelerating the analysis of multi-temporal data and reducing the cost of producing geospatial information. In this study, an unsupervised chang...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Signal Processing Letters

سال: 2022

ISSN: ['1558-2361', '1070-9908']

DOI: https://doi.org/10.1109/lsp.2022.3185955